EMMA: A novel Evaluation Metric for Morphological Analysis

نویسندگان

  • Sebastian Spiegler
  • Christian Monson
چکیده

We present a novel Evaluation Metric for Morphological Analysis (EMMA) that is both linguistically appealing and empirically sound. EMMA uses a graphbased assignment algorithm, optimized via integer linear programming, to match morphemes of predicted word analyses to the analyses of a morphologically rich answer key. This is necessary especially for unsupervised morphology analysis systems which do not have access to linguistically motivated morpheme labels. Across 3 languages, EMMA scores of 14 systems have a substantially greater positive correlation with mean average precision in an information retrieval (IR) task than do scores from the metric currently used by the Morpho Challenge (MC) competition series. We compute EMMA and MC metric scores for 93 separate system-language pairs from the 2007, 2008, and 2009 MC competitions, demonstrating that EMMA is not susceptible to two types of gaming that have plagued recent MC competitions: Ambiguity Hijacking and Shared Morpheme Padding. The EMMA evaluation script is publicly available from http://www.cs.bris.ac.uk/ Research/MachineLearning/ Morphology/Resources/.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine learning for the analysis of morphologically complex languages

This thesis demonstrates that machine learning can be applied in different ways to automate the analysis of morphologically complex agglutinating languages. Firstly, the target language Zulu, an under-resourced indigenous language of South Africa, is characterised before presenting the UKWABELANA CORPUS. The morphological Zulu corpus has been semiautomatically compiled in close cooperation with...

متن کامل

The ecological factors influencing morphological characteristics of Alburnusatr opatenae

The present study investigated morph metric characteristics of two populations of Alburnusatr opatenae in two rivers on north western Iran. In addition, the role of important ecological factors influencing morph metric features was studied. Ninety three specimens were collected from the Siminehrood (39) and Zarinerood (54) Rivers using electrocution. The specimens were photographed using a digi...

متن کامل

Diagnosis of Diabetic Retinopathy Using Processing of Fundus Images and Morphological Techniques

Introduction: Diabetic retinopathy is the damaging effect of diabetes on retinal blood vessels that can cause blindness when diagnosed late. Microaneurysms are early signs of the disease that their early diagnosis promotes timely treatment and prevents disease progression. Since this disease is asymptomatic and can only be detected by ophthalmologists, diabetic patients should be tested regular...

متن کامل

Diagnosis of Diabetic Retinopathy Using Processing of Fundus Images and Morphological Techniques

Introduction: Diabetic retinopathy is the damaging effect of diabetes on retinal blood vessels that can cause blindness when diagnosed late. Microaneurysms are early signs of the disease that their early diagnosis promotes timely treatment and prevents disease progression. Since this disease is asymptomatic and can only be detected by ophthalmologists, diabetic patients should be tested regular...

متن کامل

A Multi-Metric Index for Hydrocarbons Source Apportionment

Several studies have been conducted to develop more accurate and precise indices for hydrocarbons source apportionment. The present study, however, develops a new multi-metric index for hydrocarbons source apportionment. It measures Poly Aromatic Hydrocarbons (PAHs) concentration at six stations with well known petrogenic origin, calculating Phe/An, Flu/Py, Chr/BaA, BaA/Chr, An/(An+Ph), Flu/(Fl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010